NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

MoVer: Motion Verification for Motion Graphics Animations

https://doi.org/10.1145/3731209

Ma, Jiaju; Agrawala, Maneesh (August 2025, ACM Transactions on Graphics)

While large vision-language models can generate motion graphics animations from text prompts, they regularly fail to include all spatio-temporal properties described in the prompt. We introduce MoVer, a motion verification DSL based on first-order logic that can check spatio-temporal properties of a motion graphics animation. We identify a general set of such properties that people commonly use to describe animations (e.g., the direction and timing of motions, the relative positioning of objects, etc.). We implement these properties as predicates in MoVer and provide an execution engine that can apply a MoVer program to any input SVG-based motion graphics animation. We then demonstrate how MoVer can be used in an LLM-based synthesis and verification pipeline for iteratively refining motion graphics animations. Given a text prompt, our pipeline synthesizes a motion graphics animation and a corresponding MoVer program. Executing the verification program on the animation yields a report of the predicates that failed and the report can be automatically fed back to LLM to iteratively correct the animation. To evaluate our pipeline, we build a synthetic dataset of 5600 text prompts paired with ground truth MoVer verification programs. We find that while our LLM-based pipeline is able to automatically generate a correct motion graphics animation for 58.8% of the test prompts without any iteration, this number raises to 93.6% with up to 50 correction iterations. Our code and dataset are at https://mover-dsl.github.io.
more » « less
Free, publicly-accessible full text available August 1, 2026
Bridging the Gulf of Envisioning: Cognitive Challenges in Prompt Based Interactions with LLMs

https://doi.org/10.1145/3613904.3642754

Subramonyam, Hari; Pea, Roy; Pondoc, Christopher; Agrawala, Maneesh; Seifert, Colleen (May 2024, ACM)

Full Text Available
Editing Motion Graphics Video via Motion Vectorization and Transformation

https://doi.org/10.1145/3618316

Zhang, Sharon; Ma, Jiaju; Wu, Jiajun; Ritchie, Daniel; Agrawala, Maneesh (December 2023, ACM Transactions on Graphics)

Motion graphics videos are widely used in Web design, digital advertising, animated logos and film title sequences, to capture a viewer's attention. But editing such video is challenging because the video provides a low-level sequence of pixels and frames rather than higher-level structure such as the objects in the video with their corresponding motions and occlusions. We present amotion vectorizationpipeline for converting motion graphics video into an SVG motion program that provides such structure. The resulting SVG program can be rendered using any SVG renderer (e.g. most Web browsers) and edited using any SVG editor. We also introduce aprogram transformationAPI that facilitates editing of a SVG motion program to create variations that adjust the timing, motions and/or appearances of objects. We show how the API can be used to create a variety of effects including retiming object motion to match a music beat, adding motion textures to objects, and collision preserving appearance changes.
more » « less
Full Text Available
Tree-Structured Shading Decomposition

https://doi.org/10.1109/ICCV51070.2023.00051

Geng, Chen; Yu, Hong-Xing; Zhang, Sharon; Agrawala, Maneesh; Wu, Jiajun (October 2023, IEEE/CVF International Conference on Computer Vision (ICCV))

Full Text Available
Towards Understanding How Readers Integrate Charts and Captions: A Case Study with Line Charts

https://doi.org/10.1145/3411764.3445443

Kim, Dae Hyun; Setlur, Vidya; Agrawala, Maneesh (April 2021, Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems)
null (Ed.)
Charts often contain visually prominent features that draw attention to aspects of the data and include text captions that emphasize aspects of the data. Through a crowdsourced study, we explore how readers gather takeaways when considering charts and captions together. We first ask participants to mark visually prominent regions in a set of line charts. We then generate text captions based on the prominent features and ask participants to report their takeaways after observing chart-caption pairs. We find that when both the chart and caption describe a high-prominence feature, readers treat the doubly emphasized high-prominence feature as the takeaway; when the caption describes a low-prominence chart feature, readers rely on the chart and report a higher-prominence feature as the takeaway. We also find that external information that provides context, helps further convey the caption’s message to the reader. We use these findings to provide guidelines for authoring effective chart-caption pairs.
more » « less
Full Text Available
Searching the Visual Style and Structure of D3 Visualizations

https://doi.org/10.1109/TVCG.2019.2934431

Hoque, Enamul; Agrawala, Maneesh (January 2020, IEEE Transactions on Visualization and Computer Graphics)

Full Text Available
Answering Questions about Charts and Generating Visual Explanations

https://doi.org/10.1145/3313831.3376467

Kim, Dae Hyun; Hoque, Enamul; Agrawala, Maneesh (April 2020, Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems)

Full Text Available
Analysis of Faces in a Decade of US Cable TV News

https://doi.org/10.1145/3447548.3467134

Hong, James; Crichton, Will; Zhang, Haotian; Fu, Daniel Y.; Ritchie, Jacob; Barenholtz, Jeremy; Hannel, Ben; Yao, Xinwei; Murray, Michaela; Moriba, Geraldine; et al (August 2021, KDD '21: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining)

Full Text Available
Converting Basic D3 Charts into Reusable Style Templates

https://doi.org/10.1109/TVCG.2017.2659744

Harper, Jonathan; Agrawala, Maneesh (March 2018, IEEE Transactions on Visualization and Computer Graphics)

Full Text Available
Facilitating Document Reading by Linking Text and Tables

https://doi.org/10.1145/3242587.3242617

Kim, Dae Hyun; Hoque, Enamul; Kim, Juho; Agrawala, Maneesh (January 2018, Proceedings of the 31st Annual ACM Symposium on User Interface Software and Technology)

Document authors commonly use tables to support arguments presented in the text. But, because tables are usually separate from the main body text, readers must split their attention between different parts of the document. We present an interactive document reader that automatically links document text with corresponding table cells. Readers can select a sentence (or tables cells) and our reader highlights the relevant table cells (or sentences). We provide an automatic pipeline for extracting such references between sentence text and table cells for existing PDF documents that combines structural analysis of tables with natural language processing and rule-based matching. On a test corpus of 330 (sentence, table) pairs, our pipeline correctly extracts 48.8% of the references. An additional 30.5% contain only false negatives (FN) errors -- the reference is missing table cells. The remaining 20.7% contain false positives (FP) errors -- the reference includes extraneous table cells and could therefore mislead readers. A user study finds that despite such errors, our interactive document reader helps readers match sentences with corresponding table cells more accurately and quickly than a baseline document reader.
more » « less
Full Text Available

Search for: All records